# Visual Instruction Tuning
Llava MORE Llama 3 1 8B Finetuning
Apache-2.0
LLaVA-MORE is an enhanced version based on the LLaVA architecture, integrating LLaMA 3.1 as the language model, focusing on image-to-text tasks.
Image-to-Text
Transformers

L
aimagelab
215
9
Instructblip Flan T5 Xl 8bit Nf4
MIT
InstructBLIP is a vision-instruction-tuned version based on BLIP-2, combining visual and language processing capabilities to generate responses based on images and textual instructions.
Image-to-Text
Transformers English

I
benferns
20
0
Instructblip Flan T5 Xl 8bit Nf4
MIT
InstructBLIP is a vision instruction tuning model based on BLIP-2, using Flan-T5-xl as the language model, capable of generating descriptions based on images and text instructions.
Image-to-Text
Transformers English

I
Mediocreatmybest
22
0
Instructblip Flan T5 Xxl 8bit Nf4
MIT
InstructBLIP is the vision-instruction-tuned version of BLIP-2, combining vision and language models to generate descriptions or answer questions based on images and text instructions.
Image-to-Text
Transformers English

I
Mediocreatmybest
22
1
Instructblip Flan T5 Xl 8bit
MIT
InstructBLIP is the vision-instruction-tuned version of BLIP-2, based on the Flan-T5-xl language model, designed for image-to-text generation tasks.
Image-to-Text
Transformers English

I
Mediocreatmybest
18
1
Instructblip Vicuna 13b
Other
InstructBLIP is the visual instruction-tuned version of BLIP-2, based on the Vicuna-13b language model, designed for vision-language tasks.
Image-to-Text
Transformers English

I
Salesforce
1,251
42
Instructblip Flan T5 Xxl
MIT
InstructBLIP is the vision-instruction-tuned version of BLIP-2, capable of generating descriptions or answers based on images and text instructions
Image-to-Text
Transformers English

I
Salesforce
937
21
Instructblip Vicuna 7b
Other
InstructBLIP is a vision instruction-tuned version based on BLIP-2, using Vicuna-7B as the language model, focusing on vision-language tasks.
Image-to-Text
Transformers English

I
Salesforce
20.99k
91
Featured Recommended AI Models